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We study a two-level system controlled in a discrete feedback loop, modeling both the system 
and the controller in terms of stochastic Markov processes. We find that the extracted work, which 
is known to be bounded from above by the mutual information acquired during measurement, 
has to be compensated by an additional energy supply during the measurement process itself, 
which is bounded by the same mutual information from below. Our results confirm that the total 
cost of operating an information engine is in full agreement with the conventional second law of 
thermodynamics. We also consider the efficiency of the information engine as function of the cycle 
time and discuss the operating condition for maximal power generation. Moreover, we find that the 
entropy production of our information engine is maximal for maximal efficiency, in sharp contrast 
to conventional reversible heat engines. 


I. INTRODUCTION 


In 1867, J.C. Maxwell created a thought experiment 
to demonstrate a possible violation of the second law of 
thermodynamics: A thermally isolated container with a 
gas is divided into two parts and a fictitious demon opens 
or closes a door between the two parts depending on the 
velocity of approaching particles, creating an increase of 
the temperature in one of the compartments [T]. 

Smoluchowski was the first to provide an explanation 
why Maxwell’s demon does not work. To this end, he 
modeled the demon mechanically by a trapdoor com¬ 
bined with a gentle spring. The trapdoor acts as a valve 
in such a way that fast particles coming from one side 
can open the door while slow ones cannot, leading to 
a pressure difference. However, taking the full dynam¬ 
ics of the apparatus into account, Smoluchowski could 
demonstrate that the energy of the spring system itself 
equilibrates at such a high energy that it opens and closes 
essentially randomly, leading to the same pressure differ¬ 
ence as if the trapdoor was always open. 

In 1928, L. Szilard refined the concept of Maxwell’s 
demon, suggesting what is known today as the Szilard 
engine [3] . Starting point is a box that contains only one 
particle (see Fig. [^. If a wall is inserted in the middle, 
the particle will be in one of the two parts. Expanding 
the volume isothermally to its original size by moving the 
shutter into the empty half of the box one can extract the 
work W = kBTln2. However, this requires to know in 
which compartment the particle actually is, demonstrat¬ 
ing that the possession of information can be converted 
into physical work. Thus, in order to keep a Szilard en¬ 
gine running, a closed loop of measurement and feedback 
is needed. Very recently, such a feedback scheme could 
be realized experimentally for the first time @]. 

Since the feedback mechanism in the Szilard engine 
was perceived as information processing at zero cost, it 
seemed to produce usable work from nothing, violating 
the second law. However, as shown by Landauer [5] in 



FIG. 1: Szilard engine. In a box with a single molecule, a 
piston is inserted in the middle. Depending on the location of 
the particle, the piston is pushed in one direction, allowing ns 
to extract work. However, placing the load correctly requires 
1 bit of information about the position of the particle. At the 
end, the piston is removed at zero cost. 


1961, any irreversible logical operation requires to apply 
a well-defined minimum of work. In case of the Szilard 
engine, the demon has to store the information about the 
particle position in a single bit. According to Landauer’s 
principle, resetting this bit requires to convert a work of 
ksT In 2 into heat. As shown by Bennett [5] , who realized 
measurement and feedback as reversible processes, this 
extra work for resetting restores the second law. 

Recently Maxwell’s demon attracted renewed atten¬ 
tion as it was shown that a system in a feedback loop 
obeys an integral fluctuation theorem (IFT) of the form 

= 1 , ( 1 ) 

implying l3{Wex) < (A/), where Wex is the extracted 
work during feedback, (3 is the inverse temperature, and 
A/ is the gained mutual information between system and 
demon during the measurement [7H5]. This fluctuation 
theorem implies that in each cycle of the engine the ex¬ 
tracted work is limited by the gained mutual information, 
- a highly plausible result that nicely demonstrates the 
equivalence of thermodynamic work and information. 
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Subsequently this remarkable result was made more 
specific in various ways. For example, it was shown that 
one can construct feedback schemes that satisfy the in¬ 
equality sharply nnun]. Moreover, the IFT was gener¬ 
alized to schemes with finite-time relaxation [ima and 
continuous feedback schemes [14] . Very recently, the gen¬ 
eralized IFT could also be confirmed experimentally m- 

The arising problem with these generalized Jarzynski 
equalities is that the tightness of the bound seems to de¬ 
pend on the specihc feedback scheme such as discrete and 
continuous feedback as well as memory tape models. This 
led Barato et al. to look for a unifying master IFT [m- 
[19]. To achieve this, they duplicated the configuration 
space, modeling bit flips of the memory by transitions 
between the two replicas. Doing so they followed Smolu- 
chowski’s original idea of modeling the whole feedback 
loop as a physical device, defined as a stochastic Markov 
process. 

In this paper, we follow these lines of thought, be¬ 
ing interested in the total cost of operating an informa¬ 
tion engine as described above. Instead of duplicating 
the configuration space, we devise physically realizable 
stochastic processes of the joint system not only for re¬ 
laxation, but also for the measurement process. We show 
that the generalized IFT for the relaxation of system is 
always accompanied by an opposite IFT during the mea¬ 
surement carried out by the demon, restoring the second 
law for the joint system. This implies that a certain min¬ 
imal amount of work has to be done on the memory, in 
accordance with Landauer’s principle. The calculation 
of the total cost enables us to derive the efficiency of the 
information engine as a function of its cycle time. We 
also discuss the optimal cycle time and corresponding 
efficiency, at which the extracted power is maximized. 


II. DEFINITION OF A MINIMAL MODEL 



if possible extract AE 


FIG. 2: Schematic drawing of the information engine studied 
in the present work. The configuration of a thermally relaxing 
two-state system is measured by a controller and stored in a 
one-bit memory. If the system is found in the upper level, 
the controller induces a transition back to the ground state, 
extracting the energy AE. 


I. Relaxation 


During relaxation, the system flips randomly according 
to the rules 

0 —?► 1 with rate 
I —)■ 0 with rate fc_ . 

For convenience, we express these rates in terms of 

k = + k- and q = — (2) 

k 

with g < i. After infinite time, the system eventually 
reaches an equilibrium state with the stationary proba¬ 
bility distribution = 1 — Pstat = 9- This means that 
the energy difference between the two levels is given by 

AP = ^-Mn^ (g=l-g). (3) 

Let us now describe the relaxation process in the compos¬ 
ite configuration space of system and memory. Through¬ 
out this paper, we will use a canonical configuration basis 
ordered by 


In what follows we consider a system with two different 
energy levels separated by AE. The system is coupled to 
a heat bath with inverse temperature /?. The controller 
(demon) is implemented as a I-bit memory. We devise 
a discrete feedback scheme evolving in three steps (see 
Fig.[|). First the system relaxes thermally without ex¬ 
ternal influence. Then the actual energy level, denoted 
by 0 and I, is copied to the memory of the controller. 
Finally, if the memory bit is 1, the controller induces 
a flip of the system 1 —> 0, extracting the energy AE, 
otherwise it does nothing. 

Following Barato and Seifert [19], we are aiming 
to model these steps by stochastic Markov processes, 
including both the system and the memory. This defines 
a four-dimensional configuration space in which each 
step can be represented by a simple 4x4 matrix. In the 
following, we discuss each of the three steps in detail: 


(system state, memory bit) = {00, 01,10,11}. (4) 

Since the memory is inactive during relaxation, the time 
evolution operator Cr for relaxation represented in this 
basis reads 



System Memory 


/ 

k+ 

0 

-k_ 

0 


0 

k+ 

0 

-k_ 


-k+ 

0 

fc_ 

0 

V 

0 

-k+ 

0 

fc_ 


After the relaxation time tpi, the corresponding transition 
matrix is given by 


TnitR) = expi-CRtji) . 


( 6 ) 
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In the infinite-time relaxation limit > oo), this tran¬ 
sition matrix reduces to 


Tr = lim TnitR) 


/q 0 q 0\ 

0 9 0 9 

U 0 9 0 

Vo 9 0 9/ 


(7) 


2. Measurement 


A perfect measurement would faithfully copy the system 
state to the memory, i.e., if to = 0,1 denotes the previ¬ 
ous memory state and s = 0,1 the actual system state, 
it would simply copy (s, to) i—)■ (s, s). However, it is well- 
known that such a perfect measurement is irreversible, 
leading to a diverging entropy production [T9l[20]. There¬ 
fore, one usually considers imperfect measurements 


(s, to) I— >■ (s, s) with prob. e = 1 — e 

(s, to) I— >■ (s, 1 — s) with prob. e 


with a small error probability 0 < e < 1/2. Using the 
basis Q this corresponds to the transition matrix 


/£ e 0 0\ 
e e 0 01 
0 0 e e 

Vo 0 e e/ 


( 8 ) 


Remarkably, this imperfect measurement process can be 
implemented by a stochastic Markov process as well, 
since = Tm- The corresponding time evolution oper¬ 
ator reads 


Cm — k 


/ e -e 0 0 \ 

I -e e 0 0 I 

0 0 e -e 

Vo 0 -e e / 


(9) 


with a rate k' and it is easy to show that the transition 
matrix 7m in Eq. Q is retrieved in the limit of infinite 
measurement time: 


Tm= lim TM(tM)= lim exp(-£MiM) ■ (10) 

—^OO —^OO 

Thus, we succeeded to implement the second step as a 
stochastic Markov process as well. 

If the memory is considered as being in contact with 
some heat bath of inverse temperature /3 during the 
stochastic measurement process, the time evolution de¬ 
fined above implies that the incorrectly measured state 
(s, 1 —s) has a higher energy than the correctly measured 
state (s, s), and that the corresponding energy difference 
between the two composite states is given by 

AE;'=/3-Mn^. (11) 


3. Feedback 


The purpose of the feedback is to use the information 
stored in the memory in order to extract energy from the 
system. If the preceding measurement was faithful, this 
would mean to perform the transitions 


00 I—00 without extraction of energy 
11 I—01 extracting the work Wf^x = AE. 

These transitions alone would be again irreversible, caus¬ 
ing an inhnite entropy production. However, if we add 
symmetric transitions in the (unlikely) case of erroneous 
measurements, namely, 

10 I—> 10 without performing work 

01 I—)• 11 performing work, i.e., W^x = —AE, 


we obtain the feedback transition matrix 
/I 0 0 0\ 

_ _ 0 0 0 1 

0 0 10 ' 

Vo 1 0 0 / 


( 12 ) 


Thus the feedback process is carried out in such a way 
that the system state is flipped (s i—1 — s) for to = 1 
while it remains unchanged (s i—> s) for to = 0. It 
is assumed that the feedback transition occurs instan¬ 
taneously so that the total time r of a complete cycle 
Tr 7m -t 7> is given hy t = tR + tn- 

Since Tp = 1, the feedback is fully reversible, hence 
it does not produce entropy in the environment. More¬ 
over, it is easy to see that it simply exchanges the second 
and the fourth component of a vector, and therefore it 
does not change the joint entropy of system and memory. 
However, as will be shown below, it generally changes the 
entropy of the subsystems. 

Due to its reversible nature, the feedback as defined 
above cannot be implemented as a stochastic Markov 
process. However, we would like to point out that it 
is even possible to implement the feedback physically so 
that the entire chain of steps is represented cleanly as 
a sequence of stochastic processes. This can be done 
by replacing two subsequent cycles Tr —>■ Tm Tf ^ 
Tr ^ Tm ^ Tf ^ Tr equivalently by Tr 7m 
TfTrTf — t TfTmTf — t Tr. Since Tp = TfTrTf 
and Tm = TfTmTf satisfy the stochasticity condition 
(T| = Tp, T^ = Tm), the whole sequence of steps can 
be implemented by stochastic processes, providing a safe 
ground for the calculation of entropy production, mutual 
information, work, and heat. Having verified that this 
description is fully equivalent, we nevertheless keep the 
explicit feedback for simplicity in the original form. 

Since the rates k and k' simply rescale tp and tM, we 
will set 

k = k' :=1 (13) 

throughout the paper. Thus, apart from tp and tM, the 
model is controlled by only two parameters, namely, the 
relaxation parameter q and the error probability e. 
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Relaxation 


Measui’ement 


Feedback 


R 


M 

1^*4 

F 

l^’o) 


1 


The corresponding reduced vectors for the system and 
the memory read 


\pr) 



Stochastic Markov processes 


Reversible mapping 


= Q (20) 


FIG. 3: Cycle of the engine. The boxes are represented by 
three 4x4 transition matrices Tr,Tm, and Tf- After many 
cycles, the probability distributions between the steps will 
become stationary. 




\lq + eq) 


IV. ENTROPY AND ENTROPY PRODUCTION 


III. STATIONARY STATE 


1. Shannon entropy 


If the information engine runs repeatedly through 
many cycles, the probability distributions between the 
three steps will become stationary. The corresponding 
stationary probability distributions |Po)) \Pi), and IP 2 ) 
are represented as four-component vectors 

|Pfc) = (P™, P°\ Pfe'°, P^T, (k = 0,1, 2) (14) 


Given the stationary probability distributions |Pfc), it 
is straightforward to compute the entropies of the sys¬ 
tem (s), the memory (to), and the joint system (sto) be¬ 
tween the steps in a stationary cycle, using the definition 
of the Shannon entropy 

P = -^P^lnP'=, (21) 


are determined by the equations 

TFTMTfl |Po) = |Po) 

TrTfTm\Pi) = \Pi) (15) 

TmTrTf\P 2 ) = IP2) 

with |Pi) = Tr|Po) and IP 2 ) = Ta-/|Pi) (see Fig.|3]). 

The reduced stationary probability vectors of the sys¬ 
tem (s) and the memory (to) are given, respectively, as 

\pi^^) = {p^^+p^\ pr+PIT 

|pM) ^ (pOO p poi p pn)T _ (ig) 


Clearly, the measurement does not modify the system 
state, meaning that |Pi*^) = IP^*^)- Similarly, both the 
relaxation and feedback do not affect the memory, hence 
\pT)^ = and \pT) = |Pq"*^). As a result, in 

a stationary situation, the information acquired during 
the measurement is statistically the same in each cycle, 
implying that \pT) = \pT). 

As a simple example, first consider the case of infinite¬ 
time relaxation and measurement (tR,tM -^ 00 ), where 
the matrices Pr,Pm, and Tf are given by Eqs. 0, 
and (12). In this case the normalized stationary proba¬ 
bility vectors turn out to be given by 


1 ^ 0 ) 

1 ^ 1 ) 


(eg, eg, eg, eq)^ 

((eg-beg)g, (eg-f eg)g, (eg-b eg)g. 


(17) 

(eg-beg)g)^ 

(18) 

(19) 


where the sum runs over the vector components. Because 
of the aforementioned coincidence of various probability 
vectors we have and hT = ■ 

Furthermore, the reversibility of the feedback process 
guarantees that . 

For Ir , tM —>■ 00 , the expressions for the Shannon en- 


tropies reduce to 



p(s) 

= T) 

+ 

h{e) 

p(s) 

^1,2 

= K<l) 

+ 

h{q) (22) 

ttT) 

^0,1,2 

= h{eq 

+ 

eg) + h{eq + eg), 

rrism) 

Po,2 

= Hq) 

+ 

h{q) + h{e) + h{e) (23) 

p-{sm) 

= Hq) 

+ 

h{q) -b h{eq + eg) -b /i(eg -b eg), 


where we used the notation h{p) := —phip. 

During the relaxation process, where the system tries 
to restore the equilibrium distribution from the overpop¬ 
ulated ground state after energy extraction, the system 
entropy is expected to increase, provided that the 
error probability e is sufficiently small (e < g). The same 
applies to the composite entropy P^®"*!. 

To summarize, the entropy changes during relaxation 
(R), measurement (M), and feedback (F) are given by 

APj)*^ > 0, AP|^^ = 0, APj;^ = -APjj"^ < 0 , 

APjf*^ = AP^^ = APjr^ = 0 , (24) 

> 0, AP^™^ = -AhT\ = 0 • 


IF 2 ) = (eg, eg, eg, eq)'^. 
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2. Mutual information 

With these expressions, it is straightforward to compute 
the mutual information 

4 = >0, (fc = 0,1, 2) (25) 

which is a measure for the correlation between system 
and memory. This correlation is expected to build up 
during the measurement and then to decrease during 
feedback and relaxation, implying the inequalities 

A/fl < 0, A/m >0, AIf <0 ■ (26) 

It is interesting to note that the change of the composite 
entropy is purely given by amount of mutual information 
acquired during the measurement, i.e. 

A//^™^ = -A/m and A//jj"™^ = A/m ■ (27) 

For t_R, tM —t oo, we have 

4 = h{eq + eq) + h(eq + eq) - h{q) - h{q) 
4=0 (28) 
4 = h{eq + eg) + h{eq + eg) - h(e) - h{e) . 

Note that the result 4 = 0 is true only if the relaxation 
time is infinite, which obviously destroys all correlations 
between system and memory. 


3. Entropy production 

Let us now turn to the entropy production. According to 
Schnakenberg [2TH23] , whenever the system or the mem¬ 
ory jumps spontaneously from the configuration c to an¬ 
other configuration c', the amount of entropy 

A//—,(4 = 10^^^^^ (29) 

is generated in the environment. Here, denotes 

the transition rate at time t. Therefore, the mean entropy 
production rate is given by 

A//— = ^ P^{t)w,^At) In . (30) 

At 44 Wc'^cit) 

In an arbitrary nonequilibrium system, one would have 
to solve the master equation, plug the solution into the 
equation above, and integrate the resulting expression 
over a certain window of time. However, in the present 
case this is not necessary since the model is so simple 
that the rates happen to obey detailed balance, dehned 
as Pg^^fWc^c' = Pstat'^c^c in the stationary equilib¬ 
rium state. In this case, it is therefore straightforward 
to rewrite the equation given above as 

A//— = 

c^c' 

rl 

= ( 31 ) 



FIG. 4: Entropy production according to Eq. ( |34[ ) during re¬ 
laxation (lower curves) and measurement (upper curves) for 
various values of e as a function of q. If the measurement 
is accurate enough (e < g < 1/2), the entropy production 
during relaxation becomes negative. 


allowing us to compute the average entropy production 
in each step directly without integration by means of 

A//^™ = E(Aw - In P!tat ■ (32) 


Here P^ij and Pfi^ai denote the initial and the Hnal 
probabilities for a finite time span while P^f^t Ide sta¬ 
tionary probability distribution that would emerge af¬ 
ter inhnite long time. For example, during relaxation, 
Pgf^^f can be obtained by taking oo in the expres¬ 

sion for P/ in Eq. (18). Similarly, during measurement, 
Pstat = limt„^oo Pi with P| given in Eq. ( [T9| ). 

With the above formula, we obtain the following ex¬ 
pressions for the entropy production in each process: 


Ai/g"" = (Pgio + Fo“ - Pi° - In - 

A//|,r = {P^^ + P^° - P^^ - P:^°)\n- (33) 

A//|,”'' = 0 . 


Note that these expressions hold for any Hnite tn and Im- 
The last result is obvious since the feedback with Tp =i 
is a reversible operation. 

For tp^tM —t oo, by inserting Eqs. ([T7|)-([T^ we get 
explicit expressions 

Ai/^™ = _(g_e)ln| (34) 

A//-" = 2gg(e-e)ln^, 

which are plotted for various error probabilities in Fig. 

As one can see, for e < g < 1/2 the entropy produc¬ 
tion during relaxation AH^'“ is negative, meaning that 
the engine imports entropy (heat) from the environment 
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rather than producing it. Obviously, this is the regime of 
interest where we would like to operate our information 
engine. However, as can be seen, the negative entropy 
production during relaxation is always overcompensated 
by a positive one during the measurement, which is con¬ 
sistent with the second law of thermodynamics. Notice 
that the entropy production during measurement is al¬ 
ways positive since e < 1/2. 


V. WORK EXTRACTION AND SUPPLY 


Relaxation 

R 

Measurement 

A M L * 

Feedback 

F 


E 1 \e 

in ■ W 


IP IP 

Work 

extraction 


By virtue of Clausius’ law dQ = TdH, the produced 
entropy can be translated directly into an amount of 
heat. In most studies, it is usually assumed that the 
temperatures of the system and the memory are iden¬ 
tical. However, for the sake of generality, let us allow 
the temperatures to be different, assigning /3 h = I/Tr 
during relaxation and /3 m = I/Tm during measurement, 
as sketched in Fig. Thus the respective heat contri¬ 
butions averaged over all possible stochastic trajectories 
are given by 

{Qr) = , (Qm) = . (35) 

Here we use the sign convention that heat flowing away 
from the engine into the environment has a positive sign, 
i.e. we expect (Qr) to be negative and (Qm) to be pos¬ 
itive. 

In order to maintain stationarity of the system af¬ 
ter one engine cycle, the average work {Wex) extracted 
during feedback should exactly balance the average heat 
{Qr) during relaxation, i.e. 


(Vbe.)=-(QR)>0. (36) 

Consequently, the measurement process does not change 
the energy of the system. This requires an additional 
influx of energy in the form of extra work {Wsup) into 
the memory which is necessary to compensate the loss of 
heat {Qm) flowing away to the environment during the 
measurement process: 


{Wsup) = {Qm) > 0. (37) 


FIG. 5: Balancing flow of heat and work during the engine 
cycle (see text). 


identities = Pq^ and P^^ = Pq° (flip s only when 
m = I), it is easy to show explicitly that 


{Wsx)=-Pr^AH^r^'^ = -{Qr). (39) 

The additional work {Wsup) supplied during the measure¬ 
ment process can be interpreted as the energy needed to 
operate the measurement device. Technically this con¬ 
tribution comes from the fact that the energy levels of 
the joint system are different during relaxation and mea¬ 
surement so that extra energy is needed to move them 
around. For example, this could be done by applying an 
external potential in order to make the energy level of 
the erroneous composite state (s, 1 — s) higher than that 
of the correctly measured state (s, s) by the amount of 
IS.E' = In I. When the external potential is turned 
on just before the measurement, the average energy of the 
composite of system and memory increases by {Ei„) and 
similarly it looses the energy {Eout) when the potential 
is turned off at the end of the measurement: 


{Ein) = 


fn - 
e 


Pm 


pOl 

{Eout)= ' 


Pm 


plO p 

Ini . 


(40) 

Comparing the difference {Wsup) = {Ein) — {Eout) with 
Eq. ([M| one can see immediately that 


The average net work performed by the machine, defined 
as the difference of extracted and supplied work, is there¬ 
fore given by 

{Wnet) = {Wex) - {Wsup) = -{Qr) - {Qm) ■ (38) 


{Wsup) = PmAH^m'^ = {Qm) . (41) 

In the limit Ir, tM —t oo, the average work contributions 
read 


Note that the net work can change its sign depending on 
the choice of the parameters g, e, tR, and Im- 


Let us first compute the extracted work. According 
to Sect. H.3 {Wex) is given by [P^^ — P2^)Ai3, where 
AP = Pjj^ In |, see (3). Using IP^*^) = |P 2 *^) (no system 
change during measurement) together with the feedback 


{Wex) = (42) 

Pr Q 

/TM \ _ - e) , e 

{Wsup) — n In . 

PM e 
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VI. THERMODYNAMIC SECOND LAWS AND 
FLUCTUATION THEOREMS 


The total entropy production of the whole setup during 
relaxation (R) and measurement (M) is given by 


+ ALT^™ (43) 

AiJl?‘ = A77^™^ + Ai7|^" . (44) 


According to Eq. (27), the entropy differences in the joint 
system are given solely in terms of the mutual informa¬ 
tion difference 


Ai7r = -AillT = A/m 


(45) 



while the entropy differences in the environment are 
given in terms of the transferred heat by = 

Pr,m{Qr,m)- Using Eqs. (39) and ([4l|) we arrive at 


Ai/‘j°‘ = +AlM-PR{We.) (46) 

A//m* = —AIm + hliWaup) ■ 


Eor the total system including the environment the sec¬ 
ond law of thermodynamics should be satisfied for each 
process, i.e., 


Ai/^°‘ > 0 and Ai/^J* > 0 , (47) 

or equivalently 

< AIm < Pm {W sup) ■ (48) 

Most existing studies are only interested in the first in¬ 
equality during relaxation, while the other one during 
measurement is ignored. The purpose of this work is to 
point out that there is a second inequality for the mea¬ 
surement process as well, and that the two inequalities 
are complementary with respect to each other. More 
specifically, if Pr = Pm, the extracted work is bounded 
from above by the mutual information while the work re¬ 
quired to operate the memory is bounded from below by 
the same threshold. This means that the setup cannot 
be used to gain work out of nothing, {Wnet) < 0, as ex¬ 
pected by the second law. However, if the two reservoir 
temperatures were different (Pn < Pm), the extracted 
work could be larger than the supplied one, in which 
case the system operates like a conventional heat engine 
(see Fig. [6]). 

It is almost trivial to construct the integral fluctua¬ 
tion theorems (IFT’s) through the standard approach of 
stochastic thermodynamics [23] by considering the heat 
along all possible trajectories in the composite configu¬ 
rational state space. With an appropriate definition of 
the Shannon entropy for a given trajectory |23j . one can 
easily get the fluctuation theorems for the total entropy 
production for each process as 

^ ^ ^ ) = 1 , ( 49 ) 


FIG. 6: Infinite-time relaxation and measurement. The ex¬ 
tracted work (Wex), supplied work {Wsup), and the mutual 
information A/m = I 2 are displayed as functions of q for a 
constant error probability e = 0.1. Here we choose /3 m = 1, 2, 
and 5 with Pr = 1. Note that Pm{Wsup) = A/Zm” is inde¬ 
pendent of Pm- {Wf,x) is positive at e < g < 1/2 as expected 
from Eq. (42). When the temperature of the measurement 


reservoir is sufficiently lower than that of the relaxation reser¬ 
voir, for example Pm = 5, the extracted work pW^x) can be 
larger than the supplied work pWsup), as indicated by the 


The second laws in Eq. ( |47[ ) are simple consequences of 
the IFT’s with A////m = \A//)jm( traj.)). It is rather 
tricky to find the IFT’s in terms of work and mutual in¬ 
formation, because it requires an equilibrium state as an 
initial condition. This is the case only when tn becomes 
infinite so that the system is in equilibrium at the start 
of the measurement as well as at the beginning of the 
feedback. However, note that the bounds for works in 
Eq. (48) are valid even if tn is finite. 


VH. FINITE-TIME RELAXATION AND 
MEASUREMENT 


In practice, an engine is only useful if the cycle time 
T is finite. Thus, it is obviously of interest to derive all 
physical quantities as function of the cycle time. This 
allows one to find the optimum for maximal power gen¬ 
eration, as will be discussed in the next section. 

It is straightforward to obtain the transition matrices 
for relaxation and measurement for finite time spans Ir 
and Im'- 


TRptR) — 


TMptM) = 


f 7^ T T^q 

0 

TZq 

0 

\ 

0 

7^ -|- T^q 

0 

TZq 

TZq 

0 

TZ -\- TZq 

0 


\ 0 

TZq 

0 

7^ -|- j 

fM + Ml 

Ml 

0 


0 

Me 

M+M 

e 0 


0 


(50) 


0 0 M+Me Me 

0 0 Ml M+MlJ 
















FIG. 7: Finite-time relaxation and measurement. The ex¬ 
tracted work (H4a;), supplied work {Wsup), and the mutual 
information I 2 and A/Af are shown as functions of Here, 
we choose ta = tM, q = 0.2, e = 0.1, Pr — 1, and Pm = 5. 


where 


7^ := e-‘« 
M ■- e"*" 


and TZ := 1 — TZ 
and M := 1 — M . 


(51) 

(52) 


Note that the transition probability from (s, 1 — s) to 
(s, s) during measurement. Me, is smaller than e, which 
means that the measurement for finite Im is less accurate 
than in the limit of infinite time. 


Solving Eq. (15), one can find explicit but complicated 
expressions for all stationary distributions such as \Po), 

I Pi), and IP 2 ) for finite Ir and Im, which are not shown 
here explicitly. The heat dissipation during the finite¬ 
time relaxation and measurement can be obtained from 
Eq. (33) while the extracted work and the supplied work 
are given by Eqs. (36) and (37). 

Using the relation IP 2 ) = 7m|Pi) we find that 


(VF,„p) = In - . 

PM e 

Here £ = P°^ -I- Pl^ is explicitly given by 

iZ{eq + eq) + a (jZq -(- TZMe) 


£{tR,tM) = 


1-aTZM 


(53) 


(54) 


with a = TZ + TZ{q — q){e — e). In a similar manner, we 
obtain (Wex) as the function of £\ 




q-e-M{£ 

Pr 



(55) 


In the limit of Ir, Im 
recover Eq. (42). 


00 {TZ, —)■ 0), we consistently 


Note that the finite-time works in ( [^ and de¬ 
crease monotonously with TZ and M (see Fig. [^. More¬ 
over, since the correlation between system and memory 


builds up continuously during the measurement process, 
it is obvious that A/m decreases with M, remaining posi¬ 
tive by definition. The positivity of A/m guarantees that 
{Wsup) is also positive. On the other hand, {Wex) can 
be negative for short-time measurement and relaxation, 
as its upper bound A/m approaches zero for Im —>■ 0. 

The monotonous dependence shown in Fig. [^suggests 
that {Wex) becomes maximal for maximal measurement 
accuracy {t^j —> 00 ) and full relaxation {Ir -p 00 ) in or¬ 
der to redistribute and pump the overpopulated ground 
state s = 0 back to the energetically excited state s = 1. 
Therefore, both limit tM,tR —>■ 00 have to be carried out 
simultaneously. To establish this combined limit conve¬ 
niently, let us from now on set 

tR = tM = tI2 , (56) 

meaning that TZ = M = With this convention we 

expect {Wex) to be maximal in the limit of infinite cycle 
time (r —>• 00 ). Moreover, as r decreases, we expect 
{Wex) to decrease and eventually to become negative. 

If (Wex) > {Wsup), the system operates like a conven¬ 
tional heat engine. For infinite r the net work {Wnet) is 
maximal, but the power (net work per unit time) van¬ 
ishes. For finite but sufficiently large r and properly 
chosen parameters the system still produces a positive 
net work, hence the power is positive. However, as can 
be seen in Fig. the curves for {Wex) and {Wsup) cross 
each other at some finite cycle time t = Ts- At this point 
we do no longer obtain any net work from the engine, 
hence the power vanishes again. Consequently, there will 
be a particular cycle time in between, at which the power 
of the engine is maximal. In the next section, we will dis¬ 
cuss this aspect in more detail. 


VIII. EFFICIENCY 


Let us now assume that the information engine oper¬ 
ates in a regime where the net work is positive. In this 
case the whole setup can be interpreted as a conventional 
heat engine, as sketched in Fig. 0 As Pr < Pm, the up¬ 
per reservoir for the relaxation process plays a role of a 
high-temperature heat source, while the lower reservoir 
in contact with the memory device acts as a heat sink. 
The ejficiency of this heat engine in a single cycle is de¬ 
fined in the usual way as 


_ {Wnet) _ ^ {Wsup) 
\{Qr)\ {Wex) ■ 


(57) 


Using Eqs. (53) and (55) the efficiency can be rewritten 


as 


where 


A(r) = 


v{t) = 1 - 


{£ — e)M In (e/e) 
[q-e-M{£ - e)]1Z\n{q/q)' 


(58) 


(59) 
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FIG. 8: Interpretation of the information engine as a conven¬ 
tional heat engine. Heat flows from the heat reservoir 

at Pr (high temperature) into the engine which produces the 
work gain {Wnet) = {Wex) — (H4up). The remaining heat 
{Qm) is transferred to another reservoir at Pm (low temper¬ 
ature). 


Note that the relaxation and measurement processes are 
not quasi-static so that even in the limit r —>■ oo the en¬ 
gine never reaches the Carnot efficiency ? 7 c = 1 — 
Instead, we find that the efficiency is limited by a differ¬ 
ent upper bound rjmax which can be computed as follows. 
According to the monotonicity arguments discussed in 
the preceding section, ? 7 (r) is expected to become maxi¬ 
mal in the limit r —> oo. This suggests that 


ri{T) < Tlmax = hm ??(r) 

T—¥00 


1 A 


where 


Aoo = lim A(r) 

T—^OO 


2qq{e - e) In (e/e) 
(q-e) In {q/q) 


(60) 


(61) 


Fig.|^shows 1/Aoo as function of q for several values of e. 
It is obvious that 1/Aoo is positive for e < g < 1/2 with 
a unimodal shape due to the similar behavior of {W^x) 
demonstrated in Fig. As e —?> 0, 1/Aoo approaches 
zero except for g « 0. In the limit of both q ^ 0 and 
e —>■ 0 , Aoo approaches a constant bounded from below 
by Aoo > 2. Consequently, in order to obtain a positive 
work gain, the difference of temperatures should be at 
least Pm/Pr > 2 for the infinite-time process. In short, 
we find that the efficiency of the information engine is 
bounded by 


Vmax — 


‘2Pr 

Pm 


< Vc ■ 


In conventional heat engines operating with a finite cy¬ 
cle time, thermodynamic processes are no longer quasi¬ 
static, leading to an efficiency below the Carnot bound 
rjc- In our case, we also find that 77 ( 1 ") becomes maximal 
in the limit r —>■ 00 . However, in contrast to the Carnot 
limit, where the entropy production vanishes, the entropy 
production per cycle in our model AT?)/* -|- be¬ 

comes also maximal at r = 00 . This is one of the crucial 



0 0.1 0.2 0.3 0.4 0.5 

q 

FIG. 9: 1/Acxd according to Eq. ( |61| l as function of q for several 
values of e. In the infinity-time limit, {W^x) and 1/Aoo are 
positive only for e < q < 1/2. Note that, as e goes to zero, 
Aoo approaches to 2, the minimum, at g —> 0. 


features of our information engine which is totally dis¬ 
tinct from conventional heat engines. Nevertheless, the 
entropy production rate (per unit time) decreases with 
increasing r and finally vanishes at r = 00 . Hence, one 
may also say that the maximum efficiency is found at the 
minimum entropy production rate. 

Similarly, the average power gain, (P) = /Wneti/T in 
fact vanishes for r —>■ 00 because {Wnet) remains finite 
in this limit. For a realistic engine, we usually want to 
optimize the power gain, trading off the efficiency against 
the cycle time. As expected, (P) is maximized at some 
finite time. Top between Ts and 00 as shown in Fig. 10 (a). 

The efficiency of heat engines at maximal power has 
been studied previously in Refs. [53H2S]- Especially, for 
the Curzon-Ahlborn (CA) endoreversible model [23], it is 
well-known that the efficiency TycA at th e op timal power 
is given by rycA = I - VI - ??c- In Fig 
efficiency at optimal power ijop = = 

Eq. (58) as a function of rimax instead of rjc- It turns out 
that the functional behavior of rjop is completely different 
from rycA- 

In more general situations, it has been found that the 
efficiency at the maximum power obeys a universal form, 
Vop = ?7c/2 + Olrffi) for small rjc, when the engine and 
heat baths are strongly coupled (25] [211 . Our information 
engine exhibits a completely distinct behavior even for 
small rjmax- As seen in Fig. II rjop is more or less the 
same as rjmax in this regime. 

In order to investigate this unusual behavior in more 
detail, we now examine r]op analytically for small rjmax- 
As the stall time and thus Top (> t^) becomes large, a 
small TZ expansi on m ay be valid for small rjmax ■ Expand¬ 
ing rj{T) in Eq. (58) up to the linear order oiTZ = 


. [n we plot the 
r/V according to 
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FIG. 10: (a) The efficiency rj and power gain (P) as functions 
of e~'^. (b) 6“’"'' and 6“’’°*’ as functions of rjmax- In (b), the 
dotted line ~ is obtained from Eq. (641 and the dashed 


line corresponds to Eq. ( [70^ . We choose Ir = Im, q = 0.2, 
e = 0.1, and Pr = 1 for both (a) and (b). We set Pm = 5 for 
(a) and vary Pm for (b). 


we get 


Vi'^) « riTnax - 




£o - e 


= -r/2 


where we have used £ « 5o + £iTi- with 
£o = 299 (e - e) + e 


(62) 

(63) 


= (<7-e) - (9-9)(e-£)(<?-e) - 29g(e-e). 

We also checked that the linear coefficient inside the 


parentheses in Eq. (62) is always numerically positive in 
the interval e < q <1/2. 


As the stall time is defined by ri{Ts) = 0, Eq. (62) 
immediately gives us 


,2 

'max f 


e « As ry; 

where the coefficient is given by 


As = 


£1 £0 ~ e 

£0 - e q - e 


-2 


(64) 


(65) 


FIG. 11: The efficiency ryop at the optimal power for various 
e and q. Here we choose k = k' and Ir = Im for simplicity. 


It turns out that the scaling behavior in Eq. (64) extends 


quite well to finite rjmax, as can be seen in Fig. 
The optimal time Top should satisfy 


10 (b). 


d{P) 


dr 


= 0, 


which is rewritten as 

{WnetiTlop)) _ d{Wnet{n)) 


' op 


dr 


( 66 ) 


where TZop = As Top > Ts, TZop should be also 

small. This allows us to expand the above equation for 
small TZop, yielding 


/3m 


1 - ^ 


£0 - e 


2 \£o — e 9 — e 


Hop, (67) 


where £0 and £1 are the same as in Eq. (63), and \op is 
given by 


Aop - Aqo T Aqo 


£1 


£0 - e 


£0 - e q-e 


TZop- 


( 68 ) 


Plugging the above expression for Aop into Eq. (67), we 
get 


A-^l'^TZop= 


r]o 


1 + 7^ l-ln7^op’ 


which yields 

« As 

where 


rjmax J [1 + f{—ln-rjmax)Y 


(69) 


, (70) 


fix) = - 

X 


. 1 I 1 .. la a; 

In a; + 1-In Ag H- 

2 X 


(71) 
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Finally, inserting Eq. (70) into Eq. (62), it is straightfor¬ 
ward to find an approximate expression for rjop'. 


Vop ~ ( 1 I , I n I -f/ 1 M ) (1'2) 

\ I 1^ ^maa: I [1 “t” /( 

Therefore, in the limit of rimax 0, one can see rjop « 

Vmax, not ^rjmax, and that the next correction is logarith¬ 

mic and therefore quite slow. This calculation confirms 
that the linear irreversible thermodynamics slightly out 
of equilibrium in |25j should not be applicable in our case, 
simply because our processes are far from equilibrium. 
Indeed, in our case, the entropy production is maximal 
in the limit of rjmax 0. In future studies, the valid¬ 
ity of rjop « r]max should be addressed in the context of 
universality for general information engines showing the 
maximum efficiency at the same point where the entropy 
production is maximal. 


IX. CONCLUSIONS 


In this work, we have studied a simple example of 
an information engine which can be realized physically 
in terms of stochastic Markov processes. In agreement 
with previous studies, we find that the information feed¬ 
back allows one to extract work in a situation where this 
would be thermodynamically impossible without feed¬ 
back. Moreover, we confirm that total entropy produc¬ 
tion during relaxation obeys a fluctuation theorem, im¬ 
plying that the extracted work is bounded from above by 
the mutual information gain between memory and sys¬ 
tem. 

Providing a physical realization of the memory and 
the feedback loop, we have shown that the measurement 
process exhibits similar properties which are opposite in 
character. In particular, the entropy production during 
measurement is found to obey a fluctuation theorem as 
well. This implies that the measurement process itself 


costs energy, and that this additional energy supply is 
bounded from below by the same mutual information 
gain. Putting these pieces together, it is no surprise 
that the total setup consisting of system and memory 
satisfies the conventional second law of thermodynamics. 
Thus, we have shown that the thermodynamic second 
law, which is required to hold for the entire system dur¬ 
ing any finite process, leads to a duality in the properties 
of system and memory in this kind of information en¬ 
gines. 

For simplicity, we have presented most of our analytic 
results in the limit of infinite measurement- and relax¬ 
ation time. However, the extension to finite times is 
straightforward. At the end of the paper, we have ex¬ 
plicitly described some numerical results for finite-time 
measurement and relaxation. As in conventional heat 
engines the efficiency of the information engine is maxi¬ 
mized when the cycle time becomes infinite. However, in 
contrast to conventional heat engines, the entropy pro¬ 
duction is also maximal in this limit. On the other hand, 
we have demonstrated that the power gain acquires its 
maximum at a finite cycle time. We have also discussed 
the relation between the maximal efficiency and the effi¬ 
ciency at the operating point of maximal power. 

The striking differences between our model and con¬ 
ventional reversible heat engines can be traced back to 
the fact that our setup operates under non-equilibrium 
conditions. It would be interesting to investigate to what 
extent our observations can be explained in a universal 
framework. 
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